feat(dataframe): add writeJson with JsonWriteOptions by LantaoJin · Pull Request #61 · apache/datafusion-java

LantaoJin · 2026-05-18T10:11:32Z

Which issue does this PR close?

Closes feat: add DataFrame.writeJson with JsonWriteOptions #39 .

Rationale for this change

DataFrame.writeParquet shipped in #27. JSON is the third writer DataFusion's DataFrame API exposes natively (DataFrame::write_json) and is the easiest format to consume from non-Arrow downstream tooling. The implementation follows the same proto-over-JNI pattern as the merged readers, mirrors the writer-side shape we'd land for CSV (#38), and has zero binary-size impact -- DataFusion's JSON support is in the default feature set, no Cargo flag changes required.

What changes are included in this PR?

proto/json_write_options.proto -- new JsonWriteOptionsProto message
JsonWriteOptions Java builder
Java_org_apache_datafusion_DataFrame_writeJsonWithOptions JNI handler in native/src/json.rs

Are these changes tested?

Yes -- 9 new tests across JsonWriteOptionsTest and DataFrameWriteJsonTest.

Are there any user-facing changes?

Yes -- purely additive. New public API:

org.apache.datafusion.JsonWriteOptions
DataFrame.writeJson(String)
DataFrame.writeJson(String, JsonWriteOptions)

The new org.apache.datafusion.protobuf.JsonWriteOptionsProto generated class is also exposed via the protobuf-Java output, consistent with how CsvReadOptionsProto, NdJsonReadOptionsProto, etc. are exposed. No API removals, no deprecations, no behavior change for existing callers. No Cargo feature changes; binary size is unchanged.

Mirror writeParquet's surface for newline-delimited JSON. JsonWriteOptions exposes singleFileOutput, partitionCols, and fileCompressionType; the DataFusion-side JsonOptions only carries compression in writer mode (the read-side toggles like newline_delimited and schema_infer_max_rec do not apply here). JsonOptions has no fluent setters, so the native handler builds it via struct-update syntax (same idiom as ArrowReadOptions / AvroReadOptions). Option<JsonOptions> stays None when no writer-side knob is set, so DataFusion's runtime defaults are preserved when callers pass new JsonWriteOptions(). When the caller leaves singleFileOutput unset, default to directory output (with_single_file_output(false)) rather than DataFusion's Automatic mode. Automatic treats extension-bearing paths like "out.json" as single-file targets, which would silently contradict the documented "directory unless overridden" default.

andygrove · 2026-05-18T23:28:22Z

@LantaoJin could you fix conflict? Thanks

…e-json # Conflicts: # core/src/main/java/org/apache/datafusion/DataFrame.java

LantaoJin · 2026-05-19T02:32:37Z

@LantaoJin could you fix conflict? Thanks

Done

Merge remote-tracking branch 'upstream/main' into feat/dataframe-writ…

31452b5

…e-json # Conflicts: # core/src/main/java/org/apache/datafusion/DataFrame.java

andygrove merged commit 2ecf3f1 into apache:main May 19, 2026
1 check passed

LantaoJin deleted the feat/dataframe-write-json branch May 20, 2026 00:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(dataframe): add writeJson with JsonWriteOptions#61

feat(dataframe): add writeJson with JsonWriteOptions#61
andygrove merged 2 commits into
apache:mainfrom
LantaoJin:feat/dataframe-write-json

LantaoJin commented May 18, 2026 •

edited

Loading

Uh oh!

andygrove commented May 18, 2026

Uh oh!

LantaoJin commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LantaoJin commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

andygrove commented May 18, 2026

Uh oh!

LantaoJin commented May 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LantaoJin commented May 18, 2026 •

edited

Loading